Large Deviations of Extreme Eigenvalues of Random Matrices 
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We calculate analytically the probability of large deviations from its mean of the largest (smallest) eigenvalue 
of random matrices belonging to the Gaussian orthogonal, unitary and symplectic ensembles. In particular, 
we show that the probability that all the eigenvalues of an (N x N) random matrix are positive (negative) 
decreases for large N as ~ exp[— f38(0)N 2 ] where the parameter /3 characterizes the ensemble and the exponent 
8(0) = (In 3)/4 = 0.274653 ... is universal. We also calculate exactly the average density of states in matrices 
whose eigenvalues are restricted to be larger than a fixed number f, thus generalizing the celebrated Wigner 
semi-circle law. The density of states generically exhibits an inverse square-root singularity at f . 
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Studies of the statistics of the eigenvalues of random ma- 
trices have a long history going back to the seminal work 
of Wigner 1 1]. Since then, random matrices have found ap- 
plications in multiple fields including nuclear physics, quan- 
tum chaos, disordered systems, string theory and number the- 
ory 1 2]. Three classes of matrices with Gaussian entries have 
played important roles J3|: (N x N) real symmetric (Gaus- 
sian Orthogonal Ensemble (GOE)), (N x N) complex Her- 
mitian (Gaussian Unitary Ensemble (GUE)) and (2N x 2N) 
self-dual Hermitian matrices (Gaussian Symplectic Ensemble 
(GSE)). A central result in the theory of random matrices is 
the celebrated Wigner semi-circle law. It states that for large 
N and on an average, the N eigenvalues (suitably scaled) lie 



within a finite interval 



often referred to as 



the Wigner 'sea'. Within this sea, the average density of states 
has a semi-circular form (see Fig. ^ that vanishes at the two 
edges -V2N and y/2N 
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FIG. 1: The dashed line shows the semi-circular form of the aver- 
age density of states. The largest eigenvalue is centered around its 
mean y/2N and fluctuates over a scale of width iV -1 / 6 . The proba- 
bility of fluctuations on this scale is described by the Tracy- Widom 
distribution (shown schematically). 



Thus, the average of the maximum (minimum) eigenvalue 
is y/2N (-V2-/V). However, for finite but large N, the maxi- 
mum eigenvalue fluctuates, around its mean \/2N, from one 
sample to another. Relatively recently Tracy and Widom |3] 
proved that these fluctuations typically occur over a narrow 
scale of ~ 0(N~ 1 / 6 ) around the upper edge \f2N of the 
Wigner sea (see Fig. 1). More precisely, they showed |3] 
that asymptotically for large N, the scaling variable £ = 
y/2N 1/6 [A max - \/2N] has a limiting TV-independent prob- 
ability distribution, Prob[£ < x] = Fp(x) whose form de- 
pends on the value of the parameter j3 = 1, 2 and 4 char- 
acterizing respectively the GOE, GUE and GSE. The func- 
tion Fp{x\ computed as a solution of a nonlinear differential 
equation yj, approaches to 1 as x — * oo and decays rapidly 
to zero as x — > — oo. For example, for j3 = 2, F 2 (x) has the 
following tails J5|, 



F 2 (x) -> 1 - O (cxp[-4x 3/2 

— > exp[— | £c | 3 / 1 2] as n- 



73] 



as x — * oo 

(2) 



The probability density function dFp/dx thus has highly 
asymmetric tails. The distribution of the minimum eigen- 
value simply follows from the fact that Prob[A m j n > (] — 
Prob[A max < — C]- Amazingly, the Tracy-Widom distribu- 
tion has since emerged in a number of seemingly unrelated 
problems such as the longest increasing subsequence prob- 
lem 0|, directed polymers in (1 + 1) -dimensions [5], various 
(1 + 1) -dimensional growth models |6], a class of sequence 
alignment problems |7] and in finance |8]. Recently, it has 
been shown that the statistics of the largest eigenvalue is also 
of importance in population growth of organisms in fluctuat- 
ing environments |9]. 

The Tracy-Widom distribution describes the probability of 
typical and small fluctuations of A max over a very narrow re- 
gion of width ~ 0(iV" 1/6 ) around the mean (A max ) w y/2N. 
A natural question is how to describe the probability of atypi- 
cal and large fluctuations of \ ma x around its mean, say over a 
wider region of width ~ Oi^N 1 / 2 )! For example, what is the 
probability that all the eigenvalues of a random matrix are neg- 



2 



ative (or equivalently all are positive)? This is the same as the 
probability that A max < (or equivalently A m j n > 0). Since 
(A max ) w s/2N, this requires the computation of the proba- 
bility of an extremely rare event characterizing a large devi- 
ation of ~ — 0(N 1 ^ 2 ) to the left of the mean. This question 
recently came up in the context of random landscape models 
of antropic principle based string theory I Hill ill as well as in 
quantum cosmology 1 12). Here one is interested in the statis- 
tical properties of vacua associated with a random multifield 
potential, e.g., how many minima are there in a random string 
landscape? Similar questions also arise in disordered systems 
where one is interested in counting the number of local min- 
ima of a random Gaussian field 1 13]. In order to have a local 
minimum of the random landscape one needs to ensure that 
the eigenvalues of the associated random Hessian matrix are 
all positive. A related important question is: if one conditions 
all the eigenvalues to be positive, how does the average den- 
sity of states get modified from the Wigner semi-circle form? 
In this Letter, we address these issues analytically. 

It is useful to summarize our main results. In Ref. full , 
it was shown numerically that the probability that all the 
eigenvalues of a (N x N) GOE matrix (J3 = 1) are pos- 
itive (or equivalently all the eigenvalues are negative, i.e., 
Amax < 0) decreases rapidly with large N as Prob[A max < 
0] ~ exp [— 9(0)N 2 ] . A crude approximate argument was 
provided for the exponent 6(0) w 1/4 yjj], along with numer- 
ical simulations. Here we show exactly that for all ensembles 
characterized by the parameter /?, 



1 Q 

6»(0) =0— = (0.274653...)/?. 



(3) 



More generally we calculate the exact large deviation function 
associated with large fluctuations of ~ ~0(N 1 ^ 2 ) of A max to 
the left of its mean value \f2N. We show that for large TV and 
for all ensembles 



Prob [A max < t, N] ~ exp 



-[3N 2 <$> ( 
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(4) 



where t ~ 0(N X / 2 ) < \/2N is located deep inside the 
Wigner sea. The large deviation function $(y) is zero for 
y < 0, but is nontrivial for y > which we compute ex- 
actly. For small deviations to the left of the mean, taking the 
y — ► limit of $(y), we recover the left tail of the Tracy- 
Widom distribution as in Eq. (0. Thus our result for large 
deviations of ~ — 0(N 1 ^ 2 ) to the left of the mean is comple- 
mentary to the Tracy-Widom result for small fluctuations of 
~ — 0(A r_1 / 6 ) and the two solutions match smoothly. In the 
process, we also calculate exactly the modified average den- 
sity of states when all the eigenvalues are constrained to be on 
the right of a barrier say at A = £, thus generalizing Wigner's 
semi-circle law. 

Our starting point is the celebrated result due to Wigner for 
the joint probability density function (pdf) of the eigenvalues 



of a random (N x N ) matrix |0] 



P({\i})=B N exp 




MIA, -A, |) 



(5) 

where normalizes the pdf and j3 = 1, 2 and 4 corre- 
spond respectively to the GOE, GUE and GSE. The joint 
law allows one to interpret the eigenvalues as the positions 
of charged particles, repelling each other via a 2-d Coulomb 
potential (logarithmic); they are confined on a 1-d line and 
each is subject to an external harmonic potential. The pa- 
rameter (3 that characterizes the type of ensemble can be in- 
terpreted as the inverse temperature. The average density of 
states p sc (X, N) — J^iLi ^i)) /N can be calculated |2] 

from the joint pdf in Eq. (JSJl and has the Wigner semi-circular 
form of Eq. Q. In the Coulomb gas language, this is the 
average equilibrium charge density. 

Here we are interested in the probability Q n (£) that all the 
eigenvalues are bigger than say £, i.e., the probability that all 
charges lie to the right of the barrier at £. Note that, due to 
the A — > — A symmetry of the pdf in Eq. Q, this is also 
the probability that all eigenvalues are less than — £, i.e., the 
probability that A max < — C- Let us first define the restricted 
partition function 



Zn(0 



N 



exp 



It then follows that 
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(7) 



Let piy(X) — J2iLi d(^~ Xi)/N denote the spatial density 
of charges. Using standard techniques of functional integra- 
tion we may express Zn(C) as 11411 

f [ (3N f°° 
z n(0 oc V[p N }exp — / d\p N (X)X 2 

(3N 2 f°° 



dXdX' p N (X) PN (X') In (| A - A' |) 



C 



N ^ dX PN (X)\n( PN (X)) 



(8) 



where the first two terms represent the energy of the charges 
as in Eq. (|6}. The third term represents the entropy which 
has a mean field form due to the fact that all charges inter- 
act with each other via the long-range logarithmic potential. 
The charge density pjv(A) evidently satisfies the constraints: 
Pn(X) = for A < C and dXp N (X) = 1. 

Since we are interested in fluctuations of ~ 0(N 1 / 2 ), it is 
convenient to work with the rescaled variables, A = p^/N and 



3 



( = zV~N. It is reasonable to assume that the charge density 
scales as, pat (A) = TV -1 / 2 / (ATV^ 1 / 2 ). The scaling function 
evidently satisfies the constraints: 



<W(a«) = i; f(p) = o for n < z. 



(9) 



Expressing the action in Eq. (|8} in terms of rescaled charged 
density f(n), one finds that the energy term scales as ~ 
0(N 2 ) whereas the entropy term ~ O(N) is subdominant 
for large N. For large N, the functional integration can be 
carried out using the method of steepest descent. This gives, 
as a function of rescaled variable z = £/*/N, 

Z N (z) cx exp [0N 2 S(z) + O(N)] (10) 

where S(z) = max/ {£(/)} and 

d M rfM7(M)/(M')ln(lM-M'l)-(H) 
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The stationarity condition S'E(f)/5f = gives 

,2 
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(12) 



where C is a Lagrange multiplier enforcing the normalization 
of / in Eq. l|9}. Differentiating Eq. H2i with respect to \x 



gives 



(13) 



where V indicates the Cauchy principle part. It is convenient 
to introduce a shift ji = z + x where x > represents the dis- 
tance from the barrier (to the right) at z. In terms of the vari- 
able x, Eq. Jl 3i becomes an integral equation for the charge 
density 



z = V I dx' f(x') — 
Jo x 



(14) 



where the rhs represents a semi-infinite Hilbert transform. The 
real technical challenge is to invert this integral equation and 
obtain a closed form expression for the rescaled charge den- 
sity f(x). Fortunately this can be done 11411 . We find that f(x) 
is nonzero inside a finite box x 6 [0, L{z)\ and vanishes out- 
side this box. For < x < L(z), the density is given exactly 
by 



/(*) 



1 



2ir\/x 



y/Uz) - x [L(z) + 2x + 2z] 



(15) 



The length of the box L(z) can be determined from the nor- 
malization condition in Eq. (|9) and is given by 



L(z) 



y/z 2 + 6 



(16) 
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FIG. 2: The average density of states f(x) plotted as a function of 
the shifted variable x for z = — 1 (dotted line), z = (solid line), 
and 2 = 0.5 (dashed line). 
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FIG. 3: Monte Carlo computation of ln(Qjv(0)) points with error 
bars along with a quadratic fit (solid line). 



Note that the charge density f(x) depends on z, i.e. the loca- 
tion of the barrier. A plot of this density for several values of 
z are shown in Fig. [2] 

A couple of remarks are in order: (i) the charge density 
f(x) must be positive for all x including x = 0. As x — > 
0, f(x) diverges as x^ 1 / 2 . However in order that it remains 
positive, we need to ensure that the amplitude L(z)+2z > at 
x = in Eq. d!5l >. This condition, using L(z) from Eq. d!6l >. 
requires z > — y2. Thus the results in Eqs. (I15t and dl6t 
are valid only for z > — v2- Indeed, this is expected because 
exactly at z = — v2, i.e., when the barrier is placed at the left 
edge of the Wigner sea, we recover from Eq. H5\ the Wigner 
semi-circle law. For z — — v2. Eq. (I16> gives L = 2V2 
(the support of the semi-circle) and Eq. d!5l > gives /(/i) = 
y/2 - /i 2 /vr for -a/2 < fx < a/2. Thus, for any z < -a/2, 
our exact solution indicates that the charge density remains 
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t << VN, using $(y) w y 3 /6v / 2 we get, 



jLt 



FIG. 4: The analytic large formula for / with z = (solid line) 
in Eq. 1151 is compared to the numerically generated averaged his- 
togram of (6 x 6) Gaussian matrices with positive eigenvalues. De- 
spite the small size iV = 6, the agreement is already fairly good, 
except near the large fx tail. 



unchanged from the Wigner semi-circular form. Physically 
this means that if the wall is placed to the left of the lower edge 
of the Wigner sea, it has no effect on the charge distribution, 
(ii) The second remark is that the charge density f(x) changes 
its shape in an interesting fashion as one changes the barrier 
location z (see Fig. |2ji. It turns out that for any z > — v2> the 
charges always accumulate near the barrier at x — leading 



to a square-root divergence of f(x) 



-1/2 



as x 



0. In 



particular, for z — 0, this accumulation of eigenvalues near 
x = can be interpreted as the accumulation of massless 
modes in the context of a (stable) field theory, a fact that may 
be of relevance in anthropic principle based string theory. 

Knowing f(x) exactly, the Lagrange multiplier is deter- 
mined by setting /i = z in Eq. (II 2i . This gives, following 
a shift in the integral, C = -z 2 /2 + J °° dx' f(x') hi fx'). 
The saddle point action can now be evaluated explicitly 11411 



s ( z ) = ^ttt 72z 2 - 2z 4 + (30z + 2z 3 )y/6 + z 2 
216 L 



+ 27 (3 + ln(1296) - 4 In (-z + ^6 + z 2 



(17) 



The probability that all eigenvalues are to the right of £ = 
zyN is then given by, to leading order in large N, using Eqs. 
•EI)) and <|7} 



' N 



(< 



exp [-f3N 2 6(z)] (18) 



where 6(z) = S(-y/2) - S(z) and S(z) is given by Eq. O- 
Note that we have used S(— oo) = S(—y2) following remark 
(i) above. The result in Eq. ® can then be derived by setting 
t = — ( = —z^/N and one finds the large deviation function 
for y > 0, $>(y) = S(-y/2) - S(-V% + v)- For small y, 
$(y) w y 3 /6y/2 and for large y, $(y) m y 2 /2. Thus for 



Prob[A max < t, N] w exp 



-E-\y/2N l / e (t 
24' v 



(19) 

which matches exactly with the left tail of the Tracy- Widom 
distribution for all (3. For example, for = 2 one can easily 
verify this by comparing Eqs. J19i and Q. 

The probability that all eigenvalues are positive is obtained 
by setting z = in Eq. dl 8t i resulting in a remarkably simple 
and exact formula stated in Eq. l|3}. The fact that this prob- 
ability decreases as rapidly as ~ exp[— f30(0)N 2 ] for large 
N and that there are significant ~ O(N) corrections indicate 
that numerically it is extremely difficult to measure the expo- 
nent 6(0) accurately. An attempt was made in Ref. 1 1 1] using 
GOE (/? = 1) matrices up to sizes of N = 7 to fit the probabil- 
ity with the form exp[-a7V a ] that yielded a w 2.00387 and 
a w 0.3291. Clearly, the system sizes are too small to take 
this fit seriously. It turns out that instead it is easier to eval- 
uate Qn(0) directly from Eq. via a clever Monte Carlo 
method that allows us to go up to N ~ 30 1 14]. In Fig. |5]we 
show a plot of ln(Qjv(0)) measured using this Monte Carlo 
method (for (3 = 1) with a fit of the form aN 2 + bN + c. This 
fit yields a » —0.2755 which is in good agreement with the 
exact value of 6(0) = 0.274653.. predicted here. 

Another numerical check consists in computing the charge 
density /(/i) by direct sampling of Gaussian matrices and 
comparing it to the theoretical prediction in Eq. dl5> . Here, 
we are clearly restricted to small values of N. In Fig. |4j we 
compare the numerically computed f(fx) for z = obtained 
from matrices of size (6 x 6) with the theoretical prediction. 
Despite the small value of N, the agreement is already fairly 
good. 

We thank O. Bohigas for useful comments. 
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